Big Data Analytics in Genomics by Ka-Chun Wong

Big Data Analytics in Genomics by Ka-Chun Wong

Author:Ka-Chun Wong
Language: eng
Format: epub, pdf
Publisher: Springer International Publishing, Cham


2.1.3 Integration of Data Sources in Whole-Sequence Annotation Transfer

Another way to improve whole-sequence methods is to integrate additional data sources. The GOtcha method in [236], for instance, organizes the annotations of sequences similar to a query into a set of GO-like directed acyclic graphs. A P-score is calculated based on the frequency of occurrence of respective annotations and BLAST E-values of the corresponding matches. The P-score estimates the confidence attached to the annotation of the query sequence with that term, and a threshold value for the P-score allows extracting a final set of annotations. Evaluation of this approach on the Drosophila melanogaster genome showed that the results were more sensitive and specific than those obtained with the baseline approach.



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.